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MOMENT ESTIMATES IMPLIED BY MODIFIED LOG-SOBOLEV 

INEQUALITIES. 

RADOSLAW ADAMCZAK, WITOLD BEDNORZ, AND PAWEL WOLEE 


Abstract. We study a class of logarithmic Sobolev inequalities with a general form of the 
energy functional. The class generalizes various examples of modified logarithmic Sobolev 
inequalities considered previously in the literature. Refining a method of Aida and Stroock 
for the classical logarithmic Sobolev inequality, we prove that if a measure on R.” satisfies 
a modified logarithmic Sobolev inequality then it satisfies a family of L^-Sobolev-type in¬ 
equalities with non-Euclidean norms of gradients (and dimension-independent constants). 
The latter are shown to yield various concentration-type estimates for deviations of smooth 
(not necessarily Lipschitz) functions and measures of enlargements of sets corresponding 
to non-Euclidean norms. We also prove a two-level concentration result for functions of 
bounded Hessian and measures satisfying the classical logarithmic Sobolev inequality. 


1. Introduction 

Concentration of measure inequalities constitute one of the strongest and most widely used 
tools in modern high dimensional probability, geometry and analysis, crucial in establishing 
e.g. limit theorems or existence proofs by probabilistic method. Their importance was hrst 
noted in the 1970s and since then many powerful approaches have been established, which 
allow to prove concentration results, such as isoperimetric, transportation or functional in¬ 
equalities (we refer to the monograph [SU] by Ledoux for an overview). Among the functional 
inequalities approaches, the two which have proven particularly useful are those based on the 
Poincare and logarithmic Sobolev inequalities. Recall that a Borel probability measure p on 
R”' satishes the Poincare inequality with constant D if 

Var^/ < DE^|V/p 

for all sufficiently smooth functions /: R"" —>■ R, whereas the logarithmic Sobolev inequality 
holds if for all such /, 

Ent^f < DEJV/|^ 

where Ent^p = E^/^ log /^ —E^/^ logE^/^ is the usual entropy of p (throughout the article 
we use the probabilistic notation, treating / as a random variable on the probability space 
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(M", /i), in particular denotes integration with respect to jj). Above and in the 

rest of the paper | ■ | = | • I 2 always denotes the standard Euclidean norm in M"'. 

As is well known, the Poincare inequality yields subexponential concentration of Lipschitz 
functions, whereas the logarithmic Sobolev inequality implies sub-Gaussian estimates. There 
are also other functional inequalities, based either on a modification of the variance functional 
in the Poincare inequality [ini[2S] or a modihcation of the right-hand side in the logarithmic 
Sobolev inequalities da [2ni E], which yield concentration estimates with super-Gaussian 
rates or with rates between subexponential and sub-Gaussian. The general form of such 
log-Sobolev inequalities is 

Ent^f <DE^vI/(^)/2 (1) 

for an appropriate function T: R_|_ —>■ R_|_ (we postpone the introduction of technical condi¬ 
tions on the function T to subsequent sections). 

Together with the discovery of two-level concentration inequalities by Talagrand (initially 
for the exponential distribution [3H]), which improve the estimates based on Poincare inequal¬ 
ity and provide sub-Gaussian estimates for relatively small deviations and subexponential 
bounds for larger ones, a natural question arose whether results of this type could also be 
obtained via functional inequalities. It was soon answered in the affirmative by Bobkov and 
Ledoux [13] who derived new modihed logarithmic Sobolev inequalities, which were subse¬ 
quently extended by Gentil, Guillin and Miclo [201 |2T] to inequalities yielding other (two-level 
and also more general) types of concentration. 

Goncentration estimates are usually derived from modified logarithmic Sobolev inequalities 
via differential inequalities on the log-Laplace transform of the function, a method commonly 
known as the Herbst argument. This method, being very elegant and powerful is however re¬ 
stricted to functions with hnite Laplace transform, such as Lipschitz functions. For functions 
which do not satisfy the Lipschitz condition one can still use a modihcation of the Herbst 
approach, proposed by Aida and Stroock [1]. It relies on the analysis of moments and pro¬ 
vides Lp-type Sobolev inequalities, which can yield concentration, provided that one controls 
the gradient of a function. More precisely, Aida and Stroock proved that the logarithmic 
Sobolev inequality implies inequalities of the form 


||/-Ep/||p<Gz,VP |V/| 


forp > 2, where \\g\\p = {Ep\g\PY^P is the p-th moment of the function g. Glearly, controlling 
all moments of V/ allows then to derive concentration results. 

The aim of this work is to provide a uniform framework which would allow to treat the 
aforementioned modihed logarithmic inequalities and provide concentration estimates with 
general prohles of the deviation bound for functions which are not necessarily Lipschitz. Our 
approach is based on a further rehnement of the method by Aida and Stroock, which gives 
Lp-type Sobolev inequalities with non-Euclidean, p-dependent norms of the gradient on the 
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right-hand side. The form of these norms corresponds to the type of concentration satished 
by the measure. As a particular case we obtain moment inequalities for smooth functions, 
which generalize moment estimates for linear combinations of independent random variables, 
derived by Gluskin and Kwapieh [22] (see Theorem 13.111 below). 

We remark that even in the classical Euclidean framework under defective log-Sobolev 
inequalities, our results improve certain aspects of the work by Aida and Stroock. 

The precise form of the inequalities we obtain depends on the function T in the modihed 
logarithmic Sobolev inequality (IT|) and to describe it we need to introduce some technical 
notation. For this reason we postpone the precise formulation to subsequent sections and 
now we just announce that we will obtain inequalities of the form 


11 / 


< Cd 


|V/| 


’Ll 


for some norm | • 1^,^ on R"" associated with p and the function T. The geometry of the norms 
I ■ |,j,p will be responsible for the character of concentration of measure valid for the measure 
/i. In the classical case, considered by Aida and Stroock, we have simply = y/p\x\. 

As corollaries we obtain concentration results for not necessarily Lipschitz functions as 
well as bounds on the size of enlargement of sets in a setting more general than considered 
before. We also prove some concentration results for Banach space valued polynomial chaos 
in the case of not necessarily product measures, extending previous work by Borell na, 
Arcones-Gine [6], Lochowski |3T] and Adamczak [1]. Additionally we derive comparison 
principles for real-valued polynomials (or more generally functions with bounded derivatives 
of higher order) generalizing previous estimates by Adamczak and Wolff |1] and a two-level 
concentration result for functions with bounded Hessian and measures satisfying the classical 
logarithmic Sobolev inequality. 


Organization of the paper. In Section [2] we introduce the general framework for the 
inequalities we consider. Section |3] is devoted to the presentation of our results. The proofs 
are deferred to Section SJ 


2. Preliminaries 

2.1. Basic notation. We will be working mostly with a hxed probability measure /i, there¬ 
fore we will denote ||/||p = (E^|/|^)^/p, suppressing the dependence on p in the notation. 

Unless otherwise stated, Xj, i = 1, ... ,n, will denote coordinates of a point x G R”, i.e. 
X = (xi,..., Xn)- To distinguish norms on R" from the notation for moments, we will denote 
the former with single bars, e.g. for r > 1, | • |r will stand for the norm, dehned by 
\^\r = (X]r=i ■ Other important norms will be introduced in the sequel. In the case 

of r = 2 we will often suppress the subscript r and write simply | ■ | for 1-12. 
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By C, c we will denote universal constants, whereas the notation C(a) or Ca will be used 
for constants depending only on a parameter a. The values of constants may differ between 
occurrences. 

2.2. Generalized Orlicz functions and modified logarithmic Sobolev inequalities. 

To formulate our results let us hrst introduce the general abstract form of the inequalities we 
will consider. Next we will illustrate it with examples of inequalities known in the literature, 
which £t our framework. 

In what follows we will consider generalized Orlicz functions on M”, satisfying some stan¬ 
dard technical conditions given in the following 

Definition 2.1. We will say that a function T; M"" —>■ M+ U {cxo} satisfies the condition (C) 
if the following holds 
(Cl) ^(O) = 0 and T is continuous at 0, 

(C2) T(a:) > 0 for x ^ 0, 

(C3) lim|j,|^ood^(a;) = oo, 

(C4) for every x G M”, the function 1 1 —)■ \I/(fa;) is left-continuous on (0, cxd), 

(C5) for every x G M”, the function 1 1 —)■ '^{tx)/t is non-decreasing on (0, cxd), 

(C6) T is symmetric, i.e. T(x) = \l/(—x) for all x G M”. 

Consider a probability measure /r on M”', absolutely continuous with respect to the Lebesgue 
measure. The general class of functional inequalities we will consider is described in the 
following dehnition. 

Definition 2.2. Given a function T, satisfying the condition (C) and a positive constant D 
we will say that pi satisfies the modified logarithmic Sobolev inequality mLS'/(\[',D) if for 
every bounded locally Lipschitz function /: M" ^ (0, cxd), 

EnV2<DE^vl/(^)f. (2) 

Note that by the Rademacher theorem, V/ exists pi-a.s. By standard arguments one can 
show that pi satishes mLS'/(T, D) if and only if the above inequality is satisfied by all smooth 
functions of bounded support. 

We will also consider defective versions of the logarithmic Sobolev inequalities. 

Definition 2.3. Given a function T, satisfying the condition (G) and constants D,d > 0 we 
will say that pi satisfies the defective modified logarithmic Sobolev inequality dmLSIifi!, D, d), 
if for every hounded locally Lip schitz function /: M" —>■ (0, oo), 

Ent^/2 < f + dE^f. (3) 

As already mentioned, inequalities of this form have been considered by many authors 
starting from the classical work by Stam 1371, Federbush [TH], Gross 1231 on the logarithmic 
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Sobolev inequality, which in our language corresponds to the choice \I'(a;) = |xp, where | • | 
is the standard Euclidean norm. Modihed versions were hrst considered by Bobkov-Ledoux 
[T5] . then e.g. by Bobkov-Zegarlihski [16], Gentil-Guillin-Miclo [20l|2T], Barthe-Roberto [9| 
and Barthe-Kolesnikov [8]. The defective versions were investigated e.g. by Rothaus [M] . 
Bobkov-Zegarlinski [12], Barthe-Kolesnikov [S], who obtained general criteria under which 
one can infer the non-defective version from the defective one. For instance it is known 
that under some additional conditions on the function T, the defective inequality implies the 
non-defective version if one assumes certain Poincare type inequalities. 

Below we present the best known examples of modihed log-Sobolev inequalities. 

• The inequality (]2]) with T(x) = ||x||'^, where q G (1,2] and || • || is some norm on M” 
was introduced by Bobkov and Ledoux in [TB] . 

• In [13] Bobkov and Ledoux considered \k(x) = H{xi), where 


f for |x| <1/2 
^ oo for |x| > 1/2. 


This inequality was used to recover Talagrand’s concentration inequality for the expo¬ 
nential distribution [3S]- In [2CT1 Gentil, Guilin and Miclo generalized the Bobkov- 
Ledoux inequality, by considering H{x) = -f <h(|x|)l{| 2 ,|>i} for a convex 

function $. When ^{x)/x‘^ is non-decreasing on the positive half-line, the character¬ 
ization of measures on M which satisfy mLSI{H, D) for some hnite D was obtained 
in [g (the characterization of the classical case T (x) = x^ was obtained earlier in the 
seminal paper [T3] by Bobkov-Gotze). 

• The inequalities (]2]) and (]3]) for general T corresponding to a measure ^{dx) = 
on M"", under certain Bakry-Emery type conditions relating T and V were 
studied e.g. by Barthe-Kolesnikov [S], Gentil [13], Shao [36] . 

Since the aforementioned articles introduce many different approaches for proving modihed 
logarithmic Sobolev inequalities and the presentation of all of them is beyond the scope of 
this paper let us only mention that there is a multitude of examples of measures satisfying 
the inequalities in question. Gurrently available tools allow to both hnd mild sufficient 
conditions for a measure to satisfy the inequality (|2]) (resp. (|3])) with a given function T 
or starting from a measure hnd an appropriate T so that the inequality (]3]) (resp. l|3])) 
holds. Moreover, the usual tensorization and perturbation arguments developed for the 
classical logarithmic Sobolev inequality [5] work also in the modihed setting and allow to 
construct further examples. In particular, the one-dimensional characterizations of modihed 
logarithmic Sobolev inequalities allow to consider inequalities on M"" with T(x) = H{xi), 
hrst for product measures and then for their bounded perturbations. 


2.3. Families of Orlicz norms. Let us introduce another notion we need to formulate our 
results, namely a family of (quasi-)norms related to the function T. In the Sobolev type 
inequalities we are about to derive these norms will be applied to the gradient of a function. 
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For p > 0 define 




P 


Assume \1/ satisfies the condition (C). Then so does Tp for all p > 0 and we consider a family 
of (quasi-)norms | • 1 ^^ on M"", dehned as 


= inf{a > 0 : ^p{x/a) < 1 } = inf{a > 0 : ^{px/a) < p}. 


It is easy to see that is indeed a quasi-norm on M”, i.e. |x| 5 ,p = 0 iff x = 0, \tx\^,^ = 
|f| for f G M and \x + y\^,^ < K^^^{\x\^^^ + ll/kp) for some constant K^,^. If T is in addition 
convex, then | ■ |^p is a norm on M"". In what follows we will refer to the functional | • |ii,p as 
the Tp-norm or simply the norm, even if the function Tp is not necessarily convex. Note also 
that from (C5) it follows that the norms | • are non-decreasing in p, i.e. for 0 < p < g and 
any x G M”, 




( 4 ) 


Examples. To demonstrate the reasons for introducing the above abstract dehnition of the 
norms | • l^ip and to show their role in the derivation of concentration of measure results, we 
will now list some families of norms corresponding to special choices of the function T and 
present some special cases of known Sobolev type inequalities. 

(1) Clearly, if T is homogeneous, in particular if it is a norm on M”, then for any p > 0, 
I ■ l^p = T. This shows that to obtain interesting families of norms, | • |^rp, which 
can be used to control the behaviour of moments of random variables, one needs to 
consider functions T which grow faster than linearly. 

(2) If T(x) = ||x||“ for some norm || ■ || on M” and a > 1, then |x|,j,p = p^/"*||x||, where 
a* is the Holder conjugate of a. This simple example can already illustrate the role 
played by the norms | ■ 1^^^ in our estimates. For instance, it is well known [33] that 
for a standard Gaussian measure 7 on M”, for every smooth function /: MP —)■ M and 
every p > 2 , 


||/-E,/||p<Gv^ |V/| 


( 5 ) 


for some absolute constant C (where the moments || ■ ||p are calculated with respect to 
7 ). Note that if | V/| < T on M"', then by applying the Chebyshev inequality in Lp and 
optimizing in p, one recovers (up to constants) the classical Gaussian concentration 
inequality, i.e. 

7(1/ - E^/l >t)< 2exp(-ctV^^) 

for some universal constant c. It is easy to see that for T(x) = |xp, the right hand 
side of (I5]) can be written as G |V/|-i. , so the Sobolev inequality ([5|) is equivalent 
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to 


11/- eTIIp < c |v/|» 


(6) 


(3) Let us now consider an example corresponding to a two-level concentration of measure. 
Let 'l/(a:) = + |a^iri{ki|>i}) for some r G [2, oo). Recall that by | • 1^ 

we denote the norm, i.e. for x = {xi,... ,Xn), \x\r = (X]r=i can show 

that 

- \/p\A 

where ~ denotes two-sided estimates matching up to a universal multiplicative con¬ 
stant. 

It turns out that the expression given above appears in Sobolev inequalities leading 
to two-level tail estimates. Namely, in [3] it is proved that if r G [2, oo) and a measure 
H on M" satishes the mLSI{'^, D), then for all smooth functions / and p> 2, 


11 /-E, 


Ip ^ C/j.£) 


Vp 


|V/h 


+ p 


Ijr* 


|v/|, 




r,D 


|V/| 




(where the moments || • ||p are calculated with respect to //), which implies that for 
any Lipschitz function / and f > 0, we have 


h(l/ - Ep/I >t) < 2 exp 


Cr^D Blin 



(7) 


where a = sup 3 ,g]gn |V/(a;)| 2 , b = sup^jg^n |V/(x)|r. This corresponds to Talagrand’s 
two-level concentration inequality. As we will see, a similar moment bound holds also 
for r G (1,2) and even for more general functions T. We remark that for r G [1,2], 
we have 


xWp - + \/Pl(2;*)r=LpJ+ll2 


,Lp1 


where > ... > x* is the non-increasing rearrangement of the sequence |xi|,..., |x„|. 
(4) Let us remark that in the special case, when /i is a product of measures on R with 
log-concave tails and / is a linear functional, the inequalities of the form 


II/-E/ll, < |V/k, 


have been proved by Gluskin and Kwapieh in [22]. In Section [3.4.21 we will use their 
result (which we recall in Theorem 13.lip to give an interpretation of our results in 
terms of auxiliary i.i.d. sequences. 


3. Main results 


In this section we will present all our results, deferring their proofs to Section 01 
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3.1. Standing assnmptions. Let us first state the main assumptions we are going to use 
throughout the article. 

All the measures we will consider are assumed to be absolutely continuous with respect to 
the Lebesgue measure and this assumption will not be explicitly stated in all the theorems. 

Our usual assumption on the function \b, beside the condition (C), will be following growth 
condition: for some l<a<2</3<cx3 and K >1, 


V, 


a;GR"\{0} Vt>i 


K-H‘ < < A'P. 


{GK,a,p) 


Note that the condition (GK,a, 0 ) is stable under taking max or sum of functions d', and if d/ 
satishes {GK,a,i 3 ) then so does d'p for any p > 0. 


3.2. Sobolev type inequalities. Let us now present our main results, i.e. Sobolev type 
inequalities, which constitute a basis for all the subsequent corollaries. 


3.2.1. The defective case . We will start with moment estimates implied by defective inequal¬ 
ities. Recall the dehnition of the inequality dmLSI{'^, D, d) given in formula (E]). The proofs 
of results of this section are provided in Section 14.11 


Theorem 3.1. Assume that d/; M” —)■ M satisfies the condition (C) and 
l<a<2<(3<oo and K > 1. Let p be a probability measure on M"" satisfying 
dmLSI{'^,D,d). Then for all locally Lipschitz functions f: M” —>■ M and all p > (3, 


{GK,a,y) for some 




+ 


2 e 


a — 1 


|V/| 


( 8 ) 


Let us remark that for any p> (3 and any q G (0, (3) one can actually obtain 

2 e 


{p-VP 9 
p < 2(p-'3)5e^ 




+ 


a 


■ {{kdY/^w{kdY/^) iv/k. 


(9) 


This inequality is a simple consequence of the well-known Lemma 14.11 stated in Section 01 

Note that the constant 2(p-i^^ie obtained with the Lemma ITT] explodes when g —)■ 0 or 

p ^ 13. We do not know if under the assumption of the above theorem, one can prove that 
for all p >2, 

II/IIp<C'(Ao,/3)(||/I|2 + |||V/|j,^ 

Fortunately for the concentration of measure inequalities, for hxed a, (3, it is enough to control 
the growth of ||/ — E/||p for p > (3. Such a bound will be obtained in the non-defective case. 
It would be interesting to know if one can obtain meaningful Sobolev inequalities for the case 
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/3 = oo, which corresponds to the Bobkov-Ledoux inequality (satished e.g. for the product 
exponential distribution). In [3] it has been conjectured that in this case (for d = 0), 


II/-E/||p<Cd |V/b 


+ p 


|V/|. 


and a weaker inequality was proved, with the second term on the right hand side replaced 


by p 


|V/|c 


On the other hand, the reason for excluding the case a = 1 is made clear by the following 
proposition and the example below. 


Proposition 3.2. Assume that 'h: 
and K >1, 


V, 


-)■ 


3;GK"\{0} Vt>i 


satisfies the condition (C) and for some fi > 2 

^{tx) 




< Kt^. 


Let p be a probability measure on R"' satisfying dmLSI ('h, D, d). Then for all locally Lipschitz 
functions /: R” ^ R and all p> fi, 

( 10 ) 


( 11 ) 


< + 61og(p)(r> V (A'C)*-'") |v/|t. 

Example: Let z/ be a probability measure on R with the distribution function 


1 - e —+1 

= <! j 


for a; < 0, 
for a; > 0. 


Proposition 3.3. The measure v defined by CHi satisfies mLS'/('h,2) with \k(a;) = |a;|. 
Also, for every locally Lipschitz function / : R —)■ R and p> 1, 


II/IIlph < II/IImh+ log(p)||/'II lph- 

Moreover, if f{x) = x, then for p > 1, ||/||lp(i/) > (2e)“Mog(p). 


( 12 ) 


The ‘moreover’ part of the above proposition shows in particular that for p ^ oo, the 
log(p) factor in flTOD or (IT^ cannot be improved. 

Further examples: 

(1) If T(a:) = |a:p we are in the setting of the classical defective logarithmic Sobolev 
inequality. A result by Aida-Stroock [3] says that in this case for p >2, 

2 \ 


2 ^ g2d/p* 


p — 


+ D[p-2) |V/| 


(13) 


On the other hand. Theorem 13.11 specialized to this case, asserts that for p> 2, 


p < 


+ 2e^^ |V/| 


(14) 
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Let US note that the constant in front of the term involving V/ in our inequality 
does not depend on d, which is not the case in ([13]). Therefore, even though the 
Aida-Stroock bound may behave in a better way for p close to 2, our estimate (via 
Chebyshev’s inequality optimized over p) shows that the large deviation behaviour of 
functions with polynomial growth of moments of V/ can be controlled independently 
of d, which does not seem to follow from For instance, if II/II 2 < 00 and 


|V/| 


p 


< Ap'^ for some 7 > 0 , then we obtain 


lim sup 

t—>-oo 


1 

t2/(l+27) 


logP(|/| > t) < -Ca,D 


for some (explicit) constant ca,d > 0 . 

We remark that an improvement of the Aida-Stroock result for Lipschitz functions 
was obtained by Rothaus in [35] . 

(2) Consider now T(a;) = + k*ri{hi|>i}) for some r e (l,oo), which 

corresponds to the modihed logarithmic Sobolev inequality introduced in [20] and |H] 
for r > 2 and in [12] for r < 2. In the former case, the inequalities of Theorem 13.11 
read as 


p — 




+ p 


l/r* 


|V/|, 


for p > r (the assumption {GK,a,p) is satished with a = 2, (3 = r and K = 1). We 
remark that if in addition the underlying measure p satishes the Poincare inequality, 
we can replace \\f\\r by ||/||2 and obtain an inequality for any p > 2 (with altered 
constants). 

If r G (1, 2 ), one obtains 


p<G 


+ 


2 e 




+ P 


1/2 


IW/)Lbj+il2 


( 16 ) 


for p > 2 , where dlf{x),. 

I ^f(^) I I 9f(x) I 

quence 
Note that 


.,d^f{x) is the non-increasing rearrangement of the se- 


p 


l/r* 




+ p 


1/2 


i(5;/)r=p,j+ii2 


< Cp^/2 


|V/|, 


so f[T5l) is stronger then the bound flTTl) which has been derived from the classical 
logarithmic Sobolev inequality. Clearly to take advantage of the improvement one 
needs some additional information about the function /. 
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3.2.2. The non-defective case . Let us now pass to our second result, describing the moment 
estimates implied by modified LSI without defect, which will be later applied to obtain 
concentration bounds. Recall the definition of the inequality mLSI{^, D) given in formula 
([2]). The proofs of results presented in this section are deferred to Section 021 
Denote 

L{K,D,a,(d) := (iLD)i/". 

a — 1 \a — 1 / 


Theorem 3.4. Assume that T; M"" M satisfies the condition (C) and {GK,a,i 3 ) for some 
1 < a < 2 < /3 < oo. Let n be a probability measure on M” satisfying mLSIi^^, D). Then 
for all integrable (w.r.t. pi) and locally Lipschitz functions /: M" —)• M and all p> fi, 


\\f-¥.J\\,<CL(K,D,a,l3) |V/|j, 


(16) 


We note that, as is easy to see by truncation arguments, if the right-hand side of flT6|) 
is finite, then the function / is /i-integrable (in fact the p-th moment of / is finite), so the 
integrability assumption is introduced in the above theorem just for formal reasons. 

The advantage of (ITB]) with respect to (0]) is that it provides estimates of central moments 
of / in terms of norms of the gradient, without further dependence on any norms of /. This 
allows to derive concentration property for / based on the regularity of the gradient and as 
a consequence provides also concentration bounds at the level of enlargements of sets. 

Theorem 13.41 is derived from Theorem I3.1l bv means of Proposition 13. SI below. which allows 
to handle the central moment of order fi. 


Proposition 3.5. Under the assumptions of Theorem S.f, for every integrable (w.r.t. pi) 


-)■ 


and locally Lipschitz function f : 

11/ - E„/||j < ctlKDfl^ + (KDiiyA |V/|t, 


(17) 


3.3. Corollaries. Concentration: deviation inequalities and enlargement of sets. 
We will now explain how moment estimates of Theorem 13.41 imply concentration results 
expressed in terms of non-Euclidean norms of the gradient and non-Euclidean enlargements 
of measurable sets. The proofs of results from this section are presented in Section 14.31 
By Chebyshev’s inequality we obtain the following 


Corollary 3.6. Under the assumptions of Theorem \3.Ji\ for all integrable (w.r.t. p.) and 
locally Lipschitz functions f: M” —)■ M and any p> fi, 



The above corollary allows to get concentration bounds if one controls the growth of 


9{p) ■= 


|V/k, 


, no Lipschitz-type conditions need to be assumed. However, since in the 

p 
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simplest situation one may control the growth of g{p) via a uniform bound on |V/|^r^ for 
some a > 0 we will now specialize to functions which satisfy such a bound. To formulate the 
next corollary we will need to introduce the function u<i,: M+ M+ U {c)o}, defined as 




sup 


T(a;) ' 


If T satisfies the condition (C) then oj^ is left-continuous, limt_,.oa;^(t) = 0, limt^oo = 
oo and t i—)■ is non-decreasing, so oj^ is strictly increasing on < cx)}. Consider 

the inverse : M+ —)■ M+ of formally dehned as 

a;^^(s) = sup{f > 0: u^{t) < s}. 


This function is continuous and since t ha is non-decreasing, the function s h-)- 

s/ci;^^(s) is also continuous and non-decreasing. If additionally T satisfies ( GK,a,p) with 
some K > 1 and 1 < a < 2 < ^ then for all t > 0, 


K-\e A t^) < uj^{t) < K{e V t^) 


which implies that 


lim 

t^o 


u^{t) 

t 


0 , 


lim 

>-OD 


t 


= oo 


(18) 


(19) 


and in turn s/c(;^^(s) —)■ 0 as s —)■ 0 and oo as s —)■ oo. Therefore one can dehne 

a function : M+ —)■ M . to be a right-continuous inverse of s h-)- _f i.e. 


cj^(t) = sup |s > 0: < tl. 

I oj^ (s) J 

Note that uj^(t) is strictly increasing. We shall use the following observation (quite standard 
in the theory of Orlicz functions) which shows that the behaviour of the pair of functions cj^ 
and is similar to behaviour of conjugate functions: 


Lemma 3.7. Assume that T satisfies the condition (C) and is such that flT^ holds. Then 
for any t > 0, 

uj^{t) = tsup |n > 0: < t|. (20) 

Moreover, if X{t) = supy^Q{ty — a;^(i/)) is the Legendre transform of uq, then for all t > 0, 

X{t) < ul{t) < X{2t). (21) 


The role played the function in concentration inequalities is revealed by the following 
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Corollary 3.8. Under the assumptions of Theorem \3.4\ if a locally Lipschitz function f: —)■ 

M satisfies |V/(a:)|^^ < b, fi-a.e. for some a,b> 0, then for all t > 0, 

- E,/l > () < <=xp (/? - 


A version of the above corollary was obtained in [9l Proposition 26] in the special case 
when 'h(x) = H{xi) and H is an even convex function on R such that t i—)■ H{t)/U is 

non-decreasing on (0, cxd). Our result and the result of [9] are not directly comparable, on 
the one hand in [9] there is no assumption concerning the parameter ft and the constants are 
explicit, on the other hand the argument there is restricted to functions of a special form 
and to a = 2, i.e. to the case of super-Gaussian tails. We remark that the proof in [9] is 
based on the classical Herbst argument with the Laplace transform. 

We will now express the concentration property of a measure fi satisfying mLSI{'^, D) in 
the language of enlargements of sets. We will do it under an additional assumption that the 
function d' is convex. 


Corollary 3.9. Assume that is convex and let \h* he its Legendre transform. Under the 
assumptions of Theorem S.f, for every Borel set A C R" such that yi{A) >1/2 and every 
u > 0, 

fi(^A + {x e R”: ^*(x) <«})>!- 


Example. Consider \['(x) = + kiri{|xi|>i}) for some r e (l,oo). For 

r < 2, the function d' is not convex, but one can easily see that it is equivalent to a convex 
function, so we can still apply Corollary 13.91 at the cost of adjusting the constants. In this 
case (after replacing by an equivalent function) if one denotes by Bf; the unit ball of if;, 
one obtains that for r > 2, 

P(A + y/flBf + ) > 1 - Cr,De 

whereas for r G (1,2), 

¥{A + iVflBf) n (m^/'-‘5/*)) > 1 - Cr,De Wz.. 

In the case of the product distribution with marginal densities proportional to the 

above inequalities were hrst obtained by Talagrand [32] (for r > 1). The hrst functional 
approach was proposed by Bobkov and Ledoux [13] (r = cx)) and Barthe and Roberto [9] 
(r G [2, oo) as well as more general concentration rates between subexponential and sub- 
Gaussian), who used the modified log-Sobolev inequalities introduced by Gentil, Guillin and 
Miclo. A uniform setting for various types of concentration inequalities, including the ones 
mentioned above was proposed by Gozlan, who used Poincare inequalities with modified 
norms of gradients [33] • There are some subtle differences between the strength of various 
approaches, for instance Gozlan’s approach works also for r = oo and his constants do 









14 


RADOSLAW ADAMCZAK, WITOLD BEDNORZ, AND PAWEL WOLFF 


not depend on r. On the other hand in the non-product case his method introduces some 
dependence on the dimension n (see e.g. Proposition 1.2. in [23]). 

3.4. Further corollaries. Concentration inequalities for polynomials. In this sec¬ 
tion we will present corollaries concerning polynomial like functions. First we will consider 
homogeneous polynomials with coefficients in a Banach space, then arbitrary real valued 
polynomials or more generally functions with bounded derivatives of order k. The proofs of 
presented results are deferred to Section ITTl 

To formulate our results in a concise way we will need to introduce some additional no¬ 
tation. Namely for two fc-indexed matrices A = and B = (&*!,..., 4 )^ 1 ,..., 4 = 1 ) 

where G E for some Banach space E and G M we set 

n 

Moreover, for vectors x^,... ,x’‘ G M"' we dehne ® • • • 0= {x\^ ■ ■ ■ 2 ^ 4 )”!,..., 1^=1 ■ With this 
convention, the F-valued homogeneous form of degree k, given by matrix A as above, i.e. 

n 

'y ' ' y 

can be written simply as {A,x^^). 

By D^/ we will denote the /c-th derivative of a function /: M” —)■ M, which we will identify 
with the corresponding fc-indexed matrix of partial derivatives. 

3.4.1. Concentration for Banach space valued chaos. Let {E, \ ■ 1^;) be a separable Banach 
space and A = a /c-indexed F-valued matrix and X = (Xi,..., X„) a random 

vector in R"'. We will consider the random variable Z = \{A, X^^)\e. Without loss of 
generality we will assume that A is symmetric, i.e. for any permutation a of the set {1, ..., k}, 

Our main result is the following 

Theorem 3.10. Assume that is a convex function satisfying the conditions (C) and 
{GK,a,y) Olid let X be a random vector in MA, whose law is absolutely continuous and satisfies 
the mLSI (T, D). For any p > (d, 

k 

IIZ - EZ||p < ® ••• ® 2/^' 0 (22) 

where A^^p = {x G i?”: T*(x) <p}. As a consequence, for any p> fd, 

k 

w(\Z-¥.Z\>CD,K,aAkY.^ ""^P <e-F 

^ j=i ' 
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Versions of the above theorem were hrst obtained for Ganssian vectors by Borell Id 
and Arcones-Gine [6]. Snbseqnently they were proved for X with independent coordinates 
possessing log-concave tails by Lochowski [3T]) and Adamczak [1]. However, they provided 
rather estimates of \\Z\\p and the deviation of Z above GEZ, then concentration aronnd EZ. 

Let us analyze the quantities appearing on the right-hand side of (12^ . Except for the one 
corresponding to i = k, they are all expectations of suprema of random variables and as such 
are difficult to estimate. The exceptional term however is ‘deterministic’ and it is easy to see 
that for p —)■ cxD it dominates the whole sum. Estimates of this form may be therefore used to 
obtain some large deviation type estimates for |Z —EZ|. Also, in certain situations estimates 
of the troublesome expectations are available. This is the case e.g. if X is a Gaussian vector 
and E is real |28] or more generally E is a Hilbert space (this result is unpublished but it 
may be recovered from estimates in |2H]), and also if E = M, X has independent coordinates 
with log-concave tails and k < 3 [2S1EZ1E]. 

Example. Let us illustrate Theorem 13.101 on a simple example of a real-valued quadratic 
form Z = YL'i ^ centered random vector X = (Xi,...,X„) whose law /i 
satishes mLSI{^, D) with \k(x) = |x|^ for some q G (1,2] (the case studied in [l5l[T6]). We 
have \k*(x) = Gq|x|g*, therefore we obtain 


|Z - EZ||p < sup \ y^ aijUiXj sup ^ aijXii/j 

^ yes-, I 


q* i,j = l 

n n 


i=l j=l 


<?\ lA 


+ p2/'?‘ sup 


q* 2J = 1 


n n 


i=l j=l 


<1\ !/<? 


-I- p^/'^ sup 


' ij=i 


Now, by Proposition 13.51 applied with /9 = 2, for each i (note that T satishes (GK,a,p) with 
K = l,a = q and (3 = 2), 


^1 ^ ^ 

i=i 


i=i 


Thus we obtain 


|Z - EZ\\p < A + 


A — 1^0 

*j=i 


• 19 


lA 


, H = sup ^ aijXiy^ 

x,y£B% ^ 

q* i,j=l 


where 
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As a consequence, for all t > 0, 

P(|Z-EZ| >t) < 2 exp (^-CD,g min (23) 

Clearly Theorem l3.1Ul niav be applied also to quadratic forms or forms of higher order, with 
values in Banach spaces, but the resulting estimates will be then given in terms of expectations 
of suprema, which may not be so easy to estimate. Let us remark that inequalities of the 
form fl2^ with q = 2 are known as Hanson-Wright inequalities. In a slightly weaker form 
they were hrst proven in |25] for quadratic forms in independent sub-Gaussian variables. 


3.4.2. Comparison principles for real-valued polynomials . We will now restrict to a special 
choice of the function T related to the study of moments of linear combinations of i.i.d. 
random variables with logarithmically concave tails. We will start with a brief description of 
the results by Gluskin-Kwapieh [22] . 


Theorem 3.11. Let d); M_|_ —)■ M+ U {cxo} be a convex non-decreasing function, such that 
<h(0) = 0 and <h(l) = 1. Consider a sequence Zi, ..., Zn of independent symmetric random 
variables satisfying P(|Ei| > t) = Define the functions $: M+ —)■ M+ U {oo}, 

I d>(a:) if |a:| > 1 

and 4/: M” —)■ M+ U {c)o}, 

n 

i=l 

where <I>* is the Legendre transform o/<I>. 

Then for every sequence xi,... ,Xn of real numbers and every p >2, 


1 , , 


'^XiZi <C\x\<s,p 


2=1 


We remark that the assumption $(1) = 1 is just a normalization condition which allows to 
obtain two-sided moment estimates with a universal constant C (otherwise one would have 
to replace C by some (explicit) constant C^^,). 

As already mentioned in the introduction, modified log-Sobolev inequalities with the func¬ 
tion 4/ as in Theorem 13.111 were introduced by Gentil-Guillin-Miclo [201 [21] and further 
studied by Barthe-Roberto [9] (when d>(a;)/a;^ is non-increasing, which corresponds to super- 
Gaussian tail behaviour) and Gentil [19] (when ^{x)/x‘^ is non-decreasing, which corresponds 
to sub-Gaussian tail behaviour). 

In view of Theorem 13.111 our Theorem 13.41 can be given an interpretation in terms of 
independent random variables. 
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Corollary 3.12. Under the notation of Theorem \3.11[ further assume that for some K >1 
and l<a<2<(3< oo, 

< Kr\ (24) 

for allt > 1 and u > 0. Assume that a measure /i on R”' satisfies the mLSI{'^, D). Let X he 
a random vector with law fi and a vector Z = (Zi ,..., be a sequence of i.i.d. symmetric 
random variables, independent of X, such that¥(\Zi\ >t) = fort > 0. Then for every 

locally Lipschitz function /: R"" ^ R and every p >2, 

\\f{X) - Ef{X% < C{K,D,a,m{^fiX),Z)\\,. 

The interest in the above reformulation of moment inequalities stems from the fact that it 
can be used as a linearization tool, which allows to get estimates for functions with bounded- 
derivatives of higher order, in particular polynomials. 

Theorem 3.13. In the setting of Corollarv \3.12i let Z^,..., Z^ be independent copies of Z, 
independent of X. Then for every function /: R” —)■ R o/ class and every p>2 we have 

k-l 

||/(A') - E/(A)||p < (|| {D‘/(A), Z' 0... ® lip + ^ || {E_vD7(A), Z' 0 ■ • -0^)11^). 

i=l 

Note that all the terms on the right-hand side, except for the hrst one are moments of 
polynomials in independent random variables. This is also the case for the hrst term, pro¬ 
vided that / itself is a polynomial of degree k. One can thus think of Theorem 13.131 as a 
tool which allows to transfer estimates for polynomials in independent random variables to 
functions with bounded derivatives of higher order of random vectors X, whose law satishes 
mLS'/(T, D). We remark that there are many results concerning polynomials in independent 
random variables with log-concave tails, among available results there are hypercontractive 
estimates, two-sided estimates in terms of expected suprema of certain empirical processes (as 
in Theorem 13. 101) and in some cases (polynomials in Gaussian or exponential variables, poly¬ 
nomials in general variables with log-concave tails of degree at most 3) also precise two-sided 
inequalities in terms of ‘deterministic’ quantities. We do not present the detailed discussion 
here, since it would require introducing rather technical notation and would anyway boil 
down to an application of known estimates. Instead in the example below we work out a 
simple application, again to a quadratic form. 

Example: Let T(x) = Er=i(l^*Pl{h i|<i} + some r > 2 and assume that 

X = (W,... ,Xn) is a random vector whose law satishes mLS'/(\k, H). For simplicity as¬ 
sume further that X is centered. Consider hnally a quadratic form Y = f{X) for f{x) = 
where we assume without loss of generality that Oij = a^j. Thanks to 
centering, we have EV/(W) = 0. Moreover D^/ = {2aij)'fj^i. Therefore, by Theorem 
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13.131 if Zi, ..., is a sequence of i.i.d. symmetric random variables, such that 

mZi\ >t) = exp(-r*), we get 

\\Y-EY\\p<Cn,r 


E 

*j=i 


^ij Zi Zj 


for p > 2. 

Using results from IZ7I, one can hnd a deterministic expression equivalent to the p-th 
moment on the right-hand side above. It is expressed in terms of certain norms of the matrix 
A = treated as a multi-linear functional on products of certain £2 and £r* spaces. 

More precisely, 


where 


P 


1/2 


||^||{i,2}|0 = sup I ^ ttijXij : 5^4' < 1 } = ( 4 

Li=i i,j *>j=i 

n n n 

||^||{i}{2}|0 = supj ^ aijXiyj: < l,^y^ < l}, 

i,j=l i=l j=l 

n n n 

||^||{i}|{2} = supj ^ ttijXiyf. < 1 ,^ < l|. 


Lj=i 

n 


i=l 

n 


i=l 


0|{i,2} = sup I ^ ttijXij: \xiY < 1} = ( 


Hj I 


^J = l 

n 


*,i=i 

n 


*j=i 


l/r 


ii-4||0|{i}{ 2} = sup| ^ ^ i}- 

i,j=l i=l j=l 


As a consequence we obtain that for p> 2, 

||y - EK||„ <CD,,(p‘'"||y4||,i,2,|, +p||y4||„„2,|, +p‘''-||.4||,|„,2, 
+ pl/2+l/r ||.4 ||{i}|{ 2} +p^^’' ||.4||e|{i,{2}^ 
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and so for t > 0, 
p(|F-Ey| >t) 

/ f ^r*/2 

In the class of random vectors satisfying mLSI (\I/, D) this estimate is optimal np to constants 
(as it can be reversed for the vector Y = (Zi, ..., Zn)). A similar derivation may be also 
carried ont for cnbic forms as two-sided estimates of their moments are known [2], however it 
wonld involve 10 different norms of the corresponding 3-indexed matrix (nnder the assnmp- 
tion that X is isotropic). As for forms of higher order, they can also be rednced to forms 
in variables Zi,..., Zn, by means of Theorem 13.131 However hnding two-sided estimates for 
moments of the latter forms remains open. 


< 2 exp I — C£), 


■ mm 


3.5. Concentration resnlts for functions with bounded Hessian under the loga¬ 
rithmic Sobolev inequality. In this section we will consider the setting of the classical 
logarithmic Sobolev ineqnality and we will prove a two-level concentration estimate for fnnc- 
tions with bonnded derivatives of second order, which slightly improves on the special 
case of Theorem 1.2. in [Hj and Theorem 13.131 Onr approach is inspired by a very recent 
development by Bobkov, Chistyakov and Gotze [T2] who considered second order concentra¬ 
tion on the sphere 5”“^. While the anthors of na were interested mostly in snbexponential 
concentration, it tnrns ont that nsing their approach one can also obtain two-level bonnds. 
The goal of this section is to describe this derivation. Actnally, for consistency with previons 
sections, we will consider a slightly more general setting and obtain ineqnalities in terms of 
moments, which allows to obtain concentration for fnnctions with nnbonnded bnt controlled 
Hessian. It will be at a cost of deteriorating constants with respect to what can be obtained 
by working with Laplace transforms in the bonnded Hessian case (as in jl2jL 

Recall that for a matrix A = (Aij)^j^^, by ||A||j :/5 we denote the Hilbert-Shmidt norm of 

A, whereas ||A||op stands for the operator norm of A, i.e. ||A||ji ^5 = 4’ ll^ll°p = 

E n 

The main resnlt of this section is the following 

Theorem 3.14. Let fi be a probability measure on M”, such that for every p > 2 and every 
smooth function f: —)■ R, 


IV/I2 


II/-e„/||p<lvp 


P 


(25) 
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Let f: M” M. be a function of class , such that the operator norm 0/ D^/ is uniformly 
hounded on M”. Then for every t > 0, 

Tilf -^uf\ >t) < e^exp - min 

where 

a^ = Ae^(V2L^ |DV|hs +L\E^Vf\2)\ b = 2eL^ |DV|op • 

In fact we shall prove a more general result, from which the above theorem easily follows. 


Theorem 3.15. Let fv be as in Theorem 3. If Then for every k >2, every f: ML 
class and for every p >2, 


\\f-E,f\\p<L^E\Vf\2 + L^p |DV| 


op 


p 

k-l 


R of 

(26) 


< y/p 




IDVI2 + 2^^-^'>/^L^\E,B^f\2) + L^p iDViop 


m=l 


where for an m-indexed matrix A = denote l/lja = y Eri,...,*^=i 

and I • lop is the operator norm of a (two-indexed) matrix. 

The advantage of Theorem 13.141 over the case of Theorem 1.2. in [3] stems from the 
fact that in the latter instead of |||D^/|hs||2 one has || |D^/|hs|Ip- As a consequence, the tail 
bound obtained in [3] uses || |D^/|hs||oo instead of || |D^/|hs||2- On the other hand it is not 
clear to us whether Theorem 13.151 could lead to similar improvements of the results in [3] 
in the case of functions with bounded derivatives of order higher than 2, since instead of 
the term |||ll^/|op||p the bounds in [3] involve |E^D^/|op (at the cost of introducing some 
additional norms of higher order derivatives). We refer the Reader to [3] for the details. 


4. Proofs 

In the proofs we will drop the subscript /i and write simply E, Ent for E^, Ent^. 

4.1. Proofs of results from Section 13.2.11 Let us hrst state without proof the following 
well-known lemma, which follows from the convexity of the function p ha log ||X||i/p. 

Lemma 4.1. If X is a random variable, such that for some p > g > 0 and ^4 > 1, ||X||p < 
A||Xllg, then for all 0 < r < q, 

\\X\\,<A^\\X\\r. 

We are also going to use the following observation on the norms | ■ 
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Lemma 4.2. Ifsatisfies (C) and {GK,a,/ 3 ) then for any x G M"', 

4'(x) < K{\x\% + |x|§). 

Proof. First note that (C) implies that \I/(a;/|a;|$) < 1. If |x|^ > 1, then 


= 4^ ( lx 


X 


^ I 


F k 


< K\x\i, 


and if |x|^ < 1 , then 


1 > d' 


X 


F k 




□ 


We are now ready to prove Theorem 13.11 

Proof of Theorem \3 . 1[ Consider an arbitrary locally Lipschitz bounded positive function /: M"’ —)■ 
M. Arguing as in the proof of Theorem 3.4. in [3], we get 

= l(E/‘)*-'Ent/‘. 

Thus by duiLSI^'II, D,d) applied to the function f^G and by Lemma applied to Tp we 
have 


d 

dt 


(E/T" ^ t§- (E/k'‘E/‘>I'(^) + 


tVf\ 2d 


2 / 




2Dp 
^2 lit 

ol-a^n Or! 

j 5Z^II/IIL‘E/‘-“|V/IL + J5r^ll/lir‘E/‘-^|V/l|. + ^ll/ll?. 

Further denote M = KD. Using Holder’s inequality with pairs of exponents v and * 


< 


t—OL ’ a. 


for t G {fi,p) we have 


t-/3’ 


dt 


t< 


< 


2 i-“M 


2^~hM 

ni/iiniiv/kji?+^,z^ 


2-i|| f\d-P 


2d 


ll/lini|V/kJ|f + ^,„,n 


t‘2-ap 

For f G [fi,p] dehne 


pi-apa-l 


^(«) = ll/ll?/ |v/k, 
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(note that by Condition (C), if the denominator above vanishes, then / is constant and the 
theorem is trivially satisfied). Clearly x is non-decreasing and in the view of the above it 
satishes 


^ < Ma{t)x^ -I- Mb{t)x^ -1- dc{t)x 

(JjL 


(27) 


for t G where 


a{t) = b{t) = c{t) = 

Now, consider three cases: 

Case 1: x{p) < V In this case we simply have 

ll/llp<^(A/‘'“VA/''''’) |V/|j,, . 

CK — i p 

Case 2: a:(/3) > V It is easy to check that for t = /3 we have 


(28) 


and since x{t) is non-decreasing, we clearly have the above for all t G {(3,p). Combining (l27|l 
with ([28]) yields 

Cl IT 

— < + M^/%t))x^/^ + dc{t)x. 

Substituting y = x^!'^ we get 


from which we easily obtain 


1 rv 


< y{^)eP + 


'P 

d 1 


+ 




2 Va - 1 2/3-i(/3 - 1) 


e/3 


< ym)ei + -(M'/“ V M'/^)el 

a — 1 


If d < /3 then the above yields 


which means 


yip) < yil3)e^^^ +-(Mi/“ V M^/^), 

a — 1 


<e^/^\\f\\y + ^iM^/-yM^/f^) |V/k, 
a — 1 
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and ii d> (3 then nsing y{l3) > V 

y{p) < 2y{P)e^lf^ < 

hence 

\\f\\p<e^'^\\fh. 

Case 3: x(to) = V for some to E {(3,p). Argning as in Case 2, for y = 

we have 


yip) < y{to)e‘^ ^^0^ -\ — ( / = ho g 2 /to 


< yito)e^^'^~p^ H-V 

a — 1 ^ ' 


(29) 


Case 3a: to > Then A — 1 < 1 and nsing the fact that yito) = V the 

ineqnality fl2^ gives 

yip) < 2e^^^(M^/“ V 


hence 


p<2e-- 

a — 1 




Case 3b: to < Again, nsing the fact that yito) = the ineqnality 

implies 

yip) < yito)e^^'^~p^ H-V 

a — 1 


which means 

ll/llp<^ll/k, 

with A = e ^*0 p . Using Lemma [4. II with q = to and r = (3 we obtain 

(p-/^)*n n j P-*0 (p-^)*0 

ll/llp < tlIP=W||/||^ = e w n'=^||/||,j 

This ends the proof for bonnded positive fnnctions. Let ns now assnme that /: M"" —)■ M 
is a bonnded locally Lipschitz fnnction. Set Qm = |/| + 1/m for m = 1, 2,... and note that 
almost everywhere with respect to the Lebesgue measure, / and all the functions are 
differentiable, moreover Vgmix) 7 ^ 0 implies that V/(x) = Vgmix). Thus the inequality for 
/ follows by a limiting argument. Removing the boundedness assumption is straightforward 
by a truncation argument. □ 


Let us now pass to the sketch of the proof of Proposition 13.21 
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Proof of Proposition \3.S{ It is enough to follow the steps of Theorem 13.11 and to replace the 
splitting value V for x{t) with V {KDY^^^ilogpY. □ 

We will now provide the proof of Proposition 13.31 which shows that the lower bound in 
our assumption {GK,a, 0 ) cannot be avoided. 

Proof of Proposition \3. tA Let us note that for \b(a;) = |x|, mLSI{'^,2) is equivalent to that 
for all bounded, locally Lipschitz functions / > 0, 

EnW<E,|f|. (30) 

For a Borel set ACM, denote = liminfg^o '^A+i-£,s))-pA) ^ ^ general fact, 

based on the co-area formula, that the isoperimetric inequality 


z/(A) log V (1 — p(A)) log 


u{A) 

valid for all Borel sets ACM implies 
formula for the entropy. 




( 31 ) 


1 - f{A) 

. Indeed, on the one hand, using the variational 


Entj,/ = sup < fgdiy-.g-.'K^Wis measurable, bounded and / e^dz/ < 1 [, (32) 


and the Fubini theorem, for any measurable and bounded : M —)• M with f do < 1 and p 
dehned as a hnite (signed) measure on M such that dp = gdo we obtain 


fgdu= / fdp= / p({f>t})dt = 


< 


Ent^(l{/>q) dt= z/({/ > f}) log 


l{/(A>dd(^) dt 
1 


where the inequality follows from 

Entj./ < 


. Hence, 

> O)log 






dt. 


dt, 


On the other hand, by the co-area formula (see e.g. Theorem 8.5.1. in [7]), 

POO 

K\f\> o+{{f>t})dt 

Jo 

which combined with the previous formula shows the implication fl3T]) 

For the isoperimetric inequality fl3T|) itself, since the density of o w.r.t. the Lebesgue 
measure, 

{ t_p —(e —(1—a:)) for x 0 

le-W-(i+A)^ for a; > 0 

is log-concave, the result of Bobkov m asserts that it is enough to check 0311) for half-lines, 
and in fact, by symmetry of o, for A = [x, oo) with a: > 0. 
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Let t G (0,1/2], Then F^^{1 — t) = log(l + log and thus 


^(1 -^)) = ^(1 + log^) > ^log^ = ^log^ V (1 -t) log 

which proves the isoperimetric inequality fl3T|) . 

For the moment estimate (IT^ one can repeat the argument from the proof of Theorem 13.11 
to get that the function x{t) as defined therein satishes 

dx 2 1 , 

H 2 ' W’ 

for t E (l,p), i.e. j and thus 

x^^‘^{p) — < logp, 

which implies flT^ . In order to show the ‘moreover’ part of the proposition note that 


X 


LP{u) 


= e 


px^ dx^ > (^ep j (-logp)^ 
= (ep(^logp)^e"P) 


p—1 —glogp 


dx 


i/p 


^ 1 ^ 
“ 2e 


□ 


4.2. Proofs of results from Section 13. 2. 21 Let us first prove the Poincare type inequality 
given in Proposition 13.51 

Proof of Proposition 13.51 The proof follows the approach by Bobkov and Zegarlihski |T6] who 
considered the function \h(a;) = \x\^ for q E [1,2]. Let Mf be the median of / under /x. We 
have 11/ — IE/II /3 < 2||/ — M/||y 3 so it is enough to prove ffT7|l with the mean replaced by the 
median. In what follows without loss of generality we will assume that Mf = 0. 

Note that for any bounded, locally Lipschitz function g \ M”' —)■ (0,cxd), the inequality 
mLSI{^, D) applied to and Lemma [4.21 yield 

Ent/ < DE/vI/(^) = D/3E/vI/^(^) 

< KDft(2-f^E\Vg\i^ + 2--Egf^--\Vg\%^). 

If a < /9 we apply the Young inequality to the last expectation to get 
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and the choice A = (2^ ^KD{I3 — a)) 


(/3-a)//3 


yields 


Ent/ < KD/3(2-^ + 2-^^{2^-^KD{/3 - a)) 

\ p 


{P-a)/a 


ma\l, + 2%' 


< C((KD)'"‘ + (/3A'C)‘''“)'’e|V9|J, + \e,/. 


( 33 ) 


Note that (l3^ is obviously valid also in the case a = f3. Also, this inequality can be extended 
to arbitrary non-negative locally Lipschitz function g: —)• M. Moreover, by Lemma 2.2 

in [16] , for any non-negative h: M"" —)• R, 

1 


Enth > ( log 


Eh, 


p{h > 0})- 

which used for h = and combined with fl3^ gives that for any non-negative locally 
Lipschitz function g : 


-)■ 


log 


i)E/ < cUkdP^ + pKDf/^XnVg\l. 


/i({h>0}) 2 ) " ^ ^ (34) 

Now, applying flMl) to the functions g = and g = f- and using the assumption Mf = 0 
and the implication V/-i- 7 ^ 0 V/± = ±V/, we obtain 


(log2 - 1/2)E/^ < c(^{KDpf^ + E|V/+|J^ 

< c({KDp^ + pKDp^^YElVfli^ 


and 

(log2 - l/2)E/f < c(^{KDY/f^ + 

< c(^{KDY/^ + {/3KDy/<^yE\Vf\i^. 

Summing the above inequalities we get 

^El/l' < 2C((A'C)'"’ + (fJKD)pE\Vff^^, 
which ends the proof. □ 

Having proven Proposition 13.51 we can reduce Theorem 13.41 to Theorem 13.11 


Proof of Theorem \3.4\ We apply Theorem 13.II to the function \f — E/| to get for p > (3, 

11 / - E/llp < 11/ - E/ll^ + + (KDpP |V/| j 

a — 1 V / 
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Thus by Proposition 13.51 
11/ - E/llp < c((KDy'l^ + (KDPf'-) |V/|» 


and Holder’s inequality we obtain 

C 


+ 

/3 a 


(KDY/^^ + iKDy/n |v/|v, 


< c 


a 


-{KDY^^ + 


a 


+ |V/| 


□ 


4.3. Proofs of results from Section 13.31 Let us start with the proof of Lemma [3.71 


Proof of Lemma 3.7 The dehnition of can be written equivalently as 

uj^{t) = tsup {m > 0 : > m}. 


Now recall that if /; M+ —)■ M+ is left-continuous, non-decreasing and satishes hma,^o fi^) = 
0 , lima,^oo f{x) = oo and g: M+ —>■ M+ is a right-continuous inverse of /, i.e. 


g{y) = sup{x > 0: f{x) < y}, 


then for all x,y > 0, 


giy) > X 


fix) < y. 


(35) 


Applying fl5BD with as / we obtain that > u if and only if u^iu) < tu, which 

proves (120]). 

For the hrst inequality in fl2T]) . £x t > 0 and take n > 0 for which the supremum in fl20l) 
is attained (such u exists due to ffl^ and left-continuity of o;^). Then for all y > u, 

y ~ y-u 

hence for all // > u, 

ty — uj^iiy) < tu. 

Moreover the above inequality holds trivially for y e (0, u]. Therefore A(t) <tu = u^it). 

To prove the second inequality of fl^ £x any u > 0 satisfying uj^iu)/u < t and note that 


tu < 2tu — uqiiu) < sup(2t// — a;^(//)) = A(2t). 

y>0 


□ 


Next, we will prove Corollary 13.81 

Proof of Corollarv \3.^ Denote L := CL{K, D, a, (3). In view of Corollary 13.61 it is enough to 
show that for all > 0, 

p < au:%it/{Lh)) 


|V/(i)|t, < tjL, Ii-a..e. 


(36) 
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Then for a given t > 0 take p = auj^{t/{Lb)) and use Corollary 13.61 and fl36D to obtain 
/i(|/ - E/I > t) < /i(|/ - E/I > L|||V/k,|P 
< < ef^-P = 

Note that the hypothesis |V/(a;)|^^ < 6, /x-a.e. implies that for any > 0, 
'L Vf{x) \ ^ l^^/ pLVf{x) \ ^ al^^/ aVf{x) \_ ^Lpb\ 
t ) V \ t ) ~ p a \ b ) at ) 




P 

^ a / Lpb 

^ ( 


p 


at 


From fl20|) in Lemma 13.71 we have 


fLbp\ p 
\ t a/ a 


w.i 


Lb 


— ? 
a 


thus (I5B]) follows. 

Before we prove Corollary 13.91 let us formulate a simple lemma. 


□ 


Lemma 4.3. Under the assumptions of Corollary \3fM we have 

h(l/ - M^f \ >t^< 2exp (^(3 - 

where L = CL{K,D,a, (3) and C is the eonstant from Corollarv \3.^ 

Proof. Since is increasing and right-continuous, we can take to to be a smallest positive 
real satisfying 

£+log2_ 

\Lb/ a 

or equivalently, exp [/3 — auj^{t/{Lb)) <1/2. Then bv Corollary 13. 8 1 /i(|/ —E/| > to) < 1/2 
and thus \Mf — E/| < to. Therefore, using Corollary 13.81 for t > 2to, 

/i(|/-M/| >t) </i(^|/-E/| > <exp 

On the other hand if t < 2to then by the dehnition of to, 

2exp(/?-au,;(^))>l, 

SO the inequality of the lemma holds trivially. □ 

Proof of Corollarv \3.tA We recall that if 4) is a Young function on M", $* is the Legendre 
transform of $ and | • |^ denotes the norm on M", dual to | • |$, then for all a: G M", 

|x|$* < |a;|J < 2|a;|$*. (37) 
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Using the lower bound from {GK,afi)i one can show that \b*(a;) = 0 iff a: = 0 and < cx) 

at all X. Further, notice that {x\ \b*(x) < u} = B{\ ■ 1) := {x: < 1}. Dehne 

the function 


f{x) = inf \x - 2 /|„-ivi/* 

y£A 


,^- 1 ^*, which implies that V/ exists 


The function / is 1-Lipschitz with respect to the norm 
almost everywhere and < 1. Since = (T„)*, (]37|l implies that {V< 1. 

Therefore, by Lemma [4.31 applied with a = u, b = 1, t = 1, 

h(/ > ^/ + l) < 2 exp - uul{l/L)^, 

where L = CL{K, D, a, (3). By the assumption ^{A) > 1/2, we have Mf = 0 and so 

Al + {vl/*(x) <u} = {f <l} = {f <Mf + 1}, 
therefore the above inequality yields 

+ {T*(x) < > 1 — 2 exp (^(3 — 

It remains to bound Ci;^(l/L) from below. Note that by the convexity of T and fllSp . the 
function Uiy is convex and everywhere hnite, hence continuous. Thus by Lemma 13.71 

ujI{ 1/L) =w/L, (38) 

where w is such that Uys,{w)/w = 1/L. On the other hand, the upper bound in flT5]l yields 

1 Uxs,{w) 

L ^ 

hence 


w 


w > 


A 


< K{w°‘ ^ V ^), 
1 1 


> 


(XL)V(a-i) (iFL)V(/3-i) - (iLL)V(a-i) 
Combining the above lower bound on w with fl38|l we obtain 

uU^/L) > /\ i-\ 


A 1. 


□ 


4.4. Proofs of results from Section 13.4L Let us start with the proofs of results concerning 
Banach space valued homogeneous polynomials. 

Proof of Theorem \3 .1 (A We will proceed by induction on k. Note that the norm | ■ can 
be expressed as a supremum of countably many functionals, therefore it is enough to prove 
the theorem for {E, 1-1) = (f^, | • |oo) with arbitrarily large, hnite N. We will denote the 
index related to the structure in the superscript, i.e. ... ,a(^ and 

v = K..4-1' 

For k = 1, the inequality in question is then just a dual formulation of Theorem 13.41 for 
/(x) = maXr<Ar I Y^=i Indeed, it is enough to note that A^,^p is the unit ball for the 
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Orlicz norm corresponding to the function which up to universal multiplicative constants 
is equivalent to the dual norm for | ■ 

Let us thus assume that the theorem is true for all homogeneous forms of degree strictly 
smaller than k. We can assume that the real-valued matrices are pairwise distinct 

and nonzero for 1 < r < iV. Since the set of zeros of a non-trivial polynomial is Lebesgue 
null, there exist open sets r = e = ±1, such that M” \ (ljr=i Ue=±i-®r) is 

Lebesgue null and on B^, 

\{A,x^’^)\E = e{A^,x^’^). 

Denoting [k] = k}, we get that almost everywhere on B^, 

n k 

Vf{x)=e(^ 

= l s=l i<u<k 

u^s 

k 

l<n<k ^ 

U^S 

n ^ 

= fe( ^ 

*2vi*fc = l 


where in the last equality we used the symmetry of the coefficients Denoting Br = 

Bl U B~^ we see that on 5^, we have 


f{.x)\^,^<Ck sup 


n 


i: 

A -1 




= Ck sup ^i)|. 

p 


Thus almost surely 


|V/(X)|g, < EA; max sup |(ZL,|/ 

l<r<Ny^A^^ 


The right-hand side above is a supremum of homogeneous forms of degree k—1 in X, moreover 
by separability of A^, p it can be clearly approximated by suprema of a hnite number of such 
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forms. Therefore, by the induction assumption, 


|V/(X)|^^ 


max sup \{A\y 


CD,K,a,l3,k-l 

i=2 


max sup 

l<r<Ny^A^^ yT 


sup I {A% ® | 


=CD,K,a,y,k'^^ sup max ® 1 /^ ® • • • (g)i/^ (g) 

This ends the proof of ([22]), since by Theorem 13.41 


Z-EZ 


< CD,K,a,/3 


|V/(X)|^^ 


The second estimate of Theorem I3.1UI follows now by the Chebyshev inequality. 


□ 


We will now pass to the proof of Theorem 13.131 Let us start with the main tool, which is 
Corollary 13.121 


Proof of Corollarv \3.1A Relating the Legendre transform of $ to the conjugation in a sense 
of Lemma 13.71 one can deduce that the growth condition 0241) on $ implies that satisfies 


k'-H^ < 


!>*(«) 


< K't^ 


for alH > 1 and m > 0, where K' = C{K,a, /3). As a consequence, T satisfies {GK',a,y) and 
therefore, for p > 13 the corollary is a direct consequence of Theorem 13.41 and Theorem 13.111 
For 2 < p < (3 we use the fact that if p satishes mLSI{'^, D), then it also satishes the 
Poincare inequality 

Var(/) < ConVfl^ 

(see [21]), which as is well known (see e.g. [16] or [32]) implies that 


||/-E/||,<C> |V/| 


Moreover, due to the normalization $(1) = 1 one can easily get |a;| < C\x\q,^ for p > 1, which 
allows to deduce the corollary for 2 < p < /9. □ 


Proof of Theorem \3.1A Given Corollary 13. 121 the proof follows with just formal changes the 
proof of Proposition 3.2. in [3]. □ 
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4.5. Proofs of results from Section 13.51 In this section we wil present the proof of 
Theorem 13.151 Theorem 13.141 will then follow by specializing to A; = 2. 

Let us start with the following simple lemma. 


Lemma 4.4. In the setting of Theorem \3.14\ fof every m < k, 

|D”^/-ED™/|2 < \/2L |D”"+V|; 

2 

Proof. We will regard D^f as a vector in ~ R"'"‘. Let X be a random vector 

distributed according to fi and G a standard Gaussian vector in R"-"*, independent of X. 
Then 


|D™/-ED”^/|^ 


' = Ex|D™/(X) - ExB^f{X)\l = EcExiB^fiX) - ExB^f{X), G)^ 
2 

<2L2EgEx|D(D™/(X),G)|2 


where the second equality follows from the Fubini theorem and the inequality from the 
assumption fl2^ . applied conditionally on G to the function x i—)■ (D™/(a;),G). Now, it is 
easy to see that 

Eg|d(d”^/(x),g)|2 = |d™+VWIL 

which ends the proof. □ 


Corollary 4.5. In the setting of Theorem\3 . 14\ for all k>2, 


k-l 


iv/b <(^/2L)'=-l |DV|2 


2 - 


m=l 


Proof. An induction on k, using Lemma [4.41 


□ 


Proof of Theorem A3.15\. By (l25|l . we have 


\\f-E,f\\,<L^ iV/h 


(39) 


It is easy to prove that |V |2 is locally Lipschitz and IVIV/I 2 I 2 < iDViop /i-a.s. Indeed, 
we have for x G R” and \h\ —>-0, by the triangle inequality and Taylor’s expansion, 

|V/(x + ft)|2 - \VfO)\2 KELi + 0(|ft|))?.il2 

- < - - 


\h\ 


\h\ 

< +^( 1 ) < +o{i), 


which via standard compactness arguments yields that IV/I 2 is locally Lipschitz and that if 
V|V/(a;)|2 exists (which happens p-a.s.), then its Euclidean norm does not exceed |D^/(x)|op. 
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Thus, another application of (1^ gives 


IV/I2 <EJV/|2+ IV/I2 


E;.|V/|2 <EJV/|2 + Tvp IDVI 


lop 


p 


which together with fl3^ gives the hrst inequality of fl26l) . The second inequality follows now 
by Corollary 14.51 □ 
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